The Role of Anchor Text in ClueWeb09 Retrieval
نویسندگان
چکیده
This report describes the work done at The University of Melbourne with the ClueWeb09 data corpus for the Web Track of TREC-2009 and TREC-2010, and for the Session Track of TREC-2010. We found that the impact-based retrieval model works well for the corpus, and that, along with some other factors, the use of an anchor text collection significantly boosts the retrieval effectiveness.
منابع مشابه
Using Anchor Text, Spam Filtering and Wikipedia for Web Search and Entity Ranking
In this paper, we document our efforts in participating to the TREC 2010 Entity Ranking and Web Tracks. We had multiple aims: For the Web Track we wanted to compare the effectiveness of anchor text of the category A and B collections and the impact of global document quality measures such as PageRank and spam scores. For the Entity Ranking Track, we use Wikipedia as a pivot to find relevant ent...
متن کاملReducing Redundancy with Anchor Text and Spam Priors
In this paper, we document our efforts in participating to the TREC 2011 Web Tracks. We had multiple aims: This year, tougher topics were selected for the Web Track, for which there is less popularity information available. We look at the relative value of anchor text for these less popular topics, and at impact of spam priors. Full-text retrieval on the ClueWeb09 B collection suffers from text...
متن کاملIncorporating social anchors for ad hoc retrieval
Anchor text has been widely used in web search as an effective complement to web page content. This motivates the investigation of similar sources of evidence about relevance. Social media postings often contain links to associated web pages, although typically not with anchor text. In this paper, we explore the use of these links and the text in the social postings as a form of anchor text (so...
متن کاملNavigation Retrieval with Site Anchor Text
In this paper we present an information retrieval system that indexes only site anchor text to verify the efficiency of reference information in a navigation retrieval task. We propose two relevancy measures to maximize limited information: reference consistency and specificity of word combination. Our results show that navigation retrieval with a site anchor text can pinpoint highly relevant d...
متن کاملEvaluation of Web Retrieval Methods Using Anchor Text
In this paper, we evaluate two types of anchor texts: a page anchor and a site anchor. Since the anchor text tends to summarize information referred ahead, it can be expected that the terms appearing there have important meaning in information retrieval. We introduce a retrieval method to give high priority to the terms in the anchor text. In the experiment, we compared the proposed method with...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010